Instant messaging method and system, communication information processing method, terminal, and storage medium

ABSTRACT

The present disclosure provides an instant messaging method and system, a communication information processing method, a terminal, and a storage medium. The instant messaging method includes: receiving, by a first terminal, a sound recorded by a user, and performing sound change processing on the sound recorded by the user to provide a changed sound; sending, by the first terminal, the changed sound after the sound change processing to a second terminal for the second terminal to synthesize the received, changed sound that has gone through the sound change processing with a pre-stored animation, so as to form analog image data and to play the analog image data. The present disclosure has advantages including rich communication forms, convenient operation, and high network transmission efficiency.

RELATED APPLICATION

This patent application is a continuation of PCT Application No.PCT/CN2013/086132, filed on Oct. 29, 2013, which claims priority toChinese Patent Application No. 201210510615.X, filed with the ChinesePatent Office on Dec. 4, 2012, the entire contents of all of which areincorporated by reference.

FIELD OF THE TECHNOLOGY

The present disclosure generally relates to the field of Internettechnologies and, more particularly, relates to an instant messagingmethod and system, a communication information processing method, aterminal, and a storage medium.

BACKGROUND OF THE DISCLOSURE

With the rapid development of science and technology and incessantchanges in communication technologies, mobile communication terminalshave been widely applied in daily life and used by people, and peoplemay call each other or send short message service messages to each otherwith a mobile communication terminal and via a network, which greatlyshortens the distance between people and further brings greatconvenience for connections and provides information exchange betweenpeople.

Currently, communication manners of network chatting include use oftext, an emoticon (including a system-contained emoticon and auser-defined emoticon), a picture (including graffiti), a voice message,a text message, and a video (including video chatting and offlinevideo). Although there are numerous forms for communication, there isstill a certain limitation unable to fully meet needs of a user. Thetext is a most widely used chatting manner, but its expressive form issimplex and hard to express a real feeling and mood of a user at thattime, thereby causing that misunderstanding frequently happens duringchatting. Same as the text chatting manner, the voice message and thetext message also have the disadvantage as a simplex expressive form.Uses of the picture and the emoticon may enrich expression of useremotion; however, during use of the picture or the emoticon, a useroften needs to search a large number of pictures or emoticons for apicture or an emoticon that suits a current state of the user, whichresults in inconvenient operation. The video chatting can truly presentvideos of both chatting parties; however, transmission of video dataneeds to occupy large bandwidth and consume a long sending time and hightraffic costs, and in a case in which a network state is not good or thebandwidth is narrow, video effects are unsatisfactory.

Therefore, there is a need to solve technical problems in the Internetand computer technology to provide improved chatting manner.

SUMMARY

Objectives of the present disclosure are to provide an instant messagingmethod and system, a communication information processing method, aterminal, and a storage medium, so as to solve problems of a simplexexpressive form, inconvenient operation, and large traffic of anexisting instant chatting manner.

The present disclosure provides an instant messaging method, including:receiving, by a first terminal, a sound recorded by a user, performing,by the first terminal, sound change processing on the sound recorded bythe user to provide a changed sound; and sending, by the first terminal,the changed sound to a second terminal for the second terminal tosynthesize the changed sound with a pre-stored animation in the secondterminal to form analog image data and to play the analog image data.

The present disclosure further provides a communication informationprocessing method, including: receiving a changed sound after soundchange processing from a to-be-communicated party; synthesizing thechanged sound with a pre-stored animation, so as to form analog imagedata; and playing the analog image data.

The present disclosure further provides an instant messaging system,including at least two terminals connected to each other via a network.Each terminal includes: a storage module, configured to store multipleanimations; a sound recording module, configured to receive a soundrecorded by a user; a sound change module, configured to perform soundchange processing on the sound recorded by the user to provide a changedsound; a communication module, configured to send the changed sound toanother terminal or receive a changed sound from another terminal; asynthesizing module, configured to synthesize the sound from the anotherterminal with the animation stored in the storage module, so as to formanalog image data; and a playing module, configured to play the analogimage data.

Compared with the existing technology, the present disclosure has thefollowing exemplary beneficial effects.

(1) The present disclosure provides an entirely inventive communicationmedium, which uses an expression animation in combination with apersonalized sound, and is shown in the form of an analog image byintelligent processing, which can vividly and correctly reflect a stateand mood of a user during communication.

(2) The present disclosure automatically matches a sound recorded by theuser with an image selected by the user to form analog image data, whichsimplifies user operations.

(3) The present disclosure can transmit a sound that has gone throughsound change processing only in a network, and then correspondinglymatch a received sound with a local animation, which greatly savestraffic during network transmission, shortens a sending response time,and improves communication efficiency.

Certainly, any product for implementing the present disclosure does notnecessarily achieve all the advantages described above at the same time.

The foregoing description merely briefly describes the technicalsolution of the present disclosure. To more clearly understand thetechnical means of the present disclosure, the technical means of thepresent disclosure can be implemented according to content of thepresent disclosure. Moreover, to make the foregoing and otherobjectives, characteristics, and advantages of the present disclosuremore comprehensible, preferred embodiments are described in detail belowwith reference to the accompanying drawings. Other aspects orembodiments of the present disclosure can be understood by those skilledin the art in light of the description, the claims, and the drawings ofthe present disclosure.

BRIEF DESCRIPTION OF THE DRAWINGS

The following drawings are merely examples for illustrative purposesaccording to various disclosed embodiments and are not intended to limitthe scope of the present disclosure. The embodiments of the presentinvention are described below with reference to the accompanyingdrawings. In these accompanying drawings:

FIG. 1 is a flowchart of an instant messaging method according to anembodiment of the present invention;

FIG. 2 is a schematic diagram of a process of forming analog image dataaccording to an embodiment of the present invention;

FIG. 3 is a flowchart of another instant messaging method according toan embodiment of the present invention;

FIG. 4 is a flowchart of a communication information processing methodaccording to an embodiment of the present invention;

FIG. 5 is a flowchart of another communication information processingmethod according to an embodiment of the present invention;

FIG. 6 is a structural diagram of an instant messaging system accordingto an embodiment of the present invention;

FIG. 7 is a structural diagram of a terminal according to an embodimentof the present invention; and

FIG. 8 is a structural diagram of another terminal according to anembodiment of the present invention.

DESCRIPTION OF EMBODIMENTS

To further explain the technical means used in the present disclosurefor achieving the intended objectives and the efficacy thereof, specificimplementation manners, methods, steps, and efficacy of an instantmessaging method and system, a communication information processingmethod, a terminal, and a storage medium provided in the presentdisclosure are described in detail below with reference to theaccompanying drawings and preferred embodiments.

The foregoing and other technical content, features, and efficacy of thepresent disclosure will be clearly presented in the detailed descriptionof the preferred embodiments with reference to the drawings. Through thedescription of the specific implementation manners, the technical meansused in the present disclosure for achieving the intended objectives andthe efficacy can be understood more deeply and specifically. However,the accompanying drawings are just used for reference and descriptionand not intended to limit the present disclosure.

The present disclosure provides an entirely new communication medium,which uses an entirely new communication manner including an expressionanimation and also including a sound, and is shown in the form of ananalog image by some intelligent processing. Referring to FIG. 1, FIG. 1is a flowchart of an instant messaging method according to an embodimentof the present invention. The method includes the following steps:

S101: A first terminal receives a sound recorded by a user and performssound change processing on the sound recorded by the user to provide achanged sound.

The sound may be recorded by the user in real time by using amicrophone, or may also be prerecorded and then directly transmitted tothe first terminal. After the sound is recorded by the user, an audioprocessing library (Soundtouch) may be used to process the sound.Soundtouch mainly implements three functions of varying a rate, varyinga tone, and varying a rate and tone at the same time, can perform areal-time operation on a media stream, and can also perform an operationon an audio file. In the present disclosure, three parameters ofSoundtouch, such as, a rate, tempo, and pitch may be adjusted, therebychanging the sound of the user to simulate some special sounds. Forexample, if the parameter pitch is adjusted to 0.8, a low and slow soundof cattle and bears can be simulated; and if the parameter pitch isadjusted to 1.2, a loud sound of chicken can be simulated. The soundbecomes more personalized and vivid after the sound change processing,and the user can simulate a suitable sound according to a current moodor state, which makes a communication process more colorful rather thanboring.

S102: The first terminal sends the changed sound that has gone throughthe sound change processing to a second terminal.

The present disclosure is a communication manner of combining the soundand the animation. However, if the first terminal directly sendscomplete animation motions and sounds to the second terminal, trafficmay be wasted and a long sending response time may also be needed.Therefore, in this embodiment, it is just required to send the changedsound after the sound change processing to the other party, and theother party performs a corresponding match between the received soundand a local animation to locally form a complete analog image of theanimation motions plus sounds. This technical logic solves the problemthat a longer time and more traffic are needed during sending of thecomplete animations plus sounds.

S103: The second terminal synthesizes the received, changed sound thathas gone through the sound change processing with a pre-storedanimation, so as to form analog image data.

There may be multiple animations pre-stored by the second terminal, andthe animation synthesized with the sound may be randomly selected, ormay also be selected by the user. The synthesized sound and animationhave consistency, so that the current mood of the user can be vividlyexpressed. Therefore, a volume or tone of the sound can be used as areference to find a suitable animation by matching. For example, whenthe sound volume of the user is large and the tone is high, an animationwith an exaggerated motion can be selected; and when the sound volume ofthe user is small and the tone is low, an animation consistent with thisstyle can be selected, so as to simulate a real scenario duringcommunication as closely as possible.

During specific implementation, a matching rule may be defined first: itis assumed that one of the animations can be divided into N motionpictures and a motion style of each motion picture is different fromothers; if a corresponding animation is selected according to thevolume, a volume upper limit Volmax can be set; in addition, volumesfrom 0 to Volmax are divided into N sections with the same number as themotion pictures and are numbered from 1 to N, and then the N motionpictures are each made to correspond to a different volume, where amotion picture with a smaller motion range corresponds to a smallervolume.

When the sound that has gone through the sound change processing (i.e.,the changed sound) is matched with a suitable motion picture, the soundthat has gone through the sound change processing can be divided intomultiple time periods, and an average value of volumes during each timeperiod is sampled. Each average value of volumes may be equal or theclosest to a volume point in the foregoing defined rule, and in thisway, a motion picture corresponding to each average value of volumes canbe found. After all motion pictures corresponding to average values ofvolumes during each time period are found, these motion pictures can becombined into a continuous animation, and the motion style of thisanimation coincides with a volume of the sound that has gone through thesound change processing.

At last, the sound that has gone through the sound change processing isincorporated with the animation to form the analog image data. In thisway, animation effects and sound effects contained in the obtainedanalog image data can vividly and correctly reflect the state and moodof the user. As shown in FIG. 2, FIG. 2 is a schematic diagram of aprocess, of forming analog image data, described above, where n is thenumber of the time periods obtained by dividing the sound that has gonethrough the sound change processing.

S104: The second terminal plays the analog image data.

A process of playing the analog image data is a process of playing thesound that has gone through the sound change processing and thecorresponding animation at the same time. Different from an existingcommunication manner, the present disclosure vividly presents a realstate of a user performing communication. However, a speed and tone of asound of each person are different from others. Therefore,synchronization and a match between a sound and a motion need to beachieved when the analog image data is played. For example, a change ina mouth shape is basically synchronous with the speaking speed of a userwhen the user speaks, and the speed of a body motion also needs to besynchronous with and match the speaking speed.

During specific implementation, when the sound that has gone through thesound change processing is played, an average value of volumes or tonesduring each time period may be sampled at the same time, and then acorresponding motion picture is synchronously displayed according to thesampled average value of volumes or tones during each time period. Inthis way, a match between the sound and the motion may be achieved.

Besides, in start and end phases of playing the analog image data, somefixed animation motions can be added. These motions are irrelevant tothe sound of the user and are just animation motions that fixedly existto make the animation interesting.

Referring to FIG. 3, FIG. 3 is a flowchart of another instant messagingmethod according to an embodiment of the present invention. The methodincludes the following steps:

S301: A first terminal receives a sound recorded by a user and performssound change processing on the sound recorded by the user to provide achanged sound.

S302: The first terminal receives a role image identification selectedby the user.

S303: The first terminal sends the sound that has gone through the soundchange processing and the role image identification to a secondterminal.

S304: The second terminal synthesizes the sound that has gone throughthe sound change processing with a pre-stored animation that correspondsto the role image identification, so as to form analog image data.

S305: The second terminal plays the analog image data.

Embodiment 2 differs from Embodiment 1 in that: the user can select ananimation image that he wants to show to the other party by using a roleimage identification, where each role image identification correspondsto one pre-stored animation image, which also makes communicationcontent more colorful.

In the communication methods in embodiment 1 and embodiment 2, a firstterminal sends communication information to a second terminal and thesecond terminal plays the communication information, so as to describeconstitution of the communication information and a communicationmanner. However, the communication manner is not limited hereto, thatis, alternatively, the second terminal may also send communicationinformation to the first terminal and the first terminal plays thecommunication information.

The present disclosure further provides a communication informationprocessing method, which is a method for processing communicationinformation formed by combining animation effects and sound effects.Referring to FIG. 4, the method includes the following steps:

S401: Receive a sound that has gone through sound change processing froma to-be-communicated party.

The to-be-communicated party described herein may be a user terminal ora communication platform. The sound that has gone through the soundchange processing may be recorded by the user in real time by using amicrophone or may be prerecorded. The sound may be processed by using anaudio processing library (Soundtouch), and the sound of the user may bechanged by adjusting three parameters of Soundtouch, such as, a rate,tempo, and pitch to simulate some special sounds. For example, if theparameter pitch is adjusted to 0.8, a low and slow sound of cattle andbears can be simulated; and if the parameter pitch is adjusted to 1.2, aloud sound of chicken can be simulated. The sound may become morepersonalized and vivid after the sound change processing, and the usercan simulate a suitable sound according to a current mood or state,which makes a communication process more colorful rather than boring.

The present disclosure is a communication manner of combining the soundand the animation. However, if the to-be-communicated party directlysends complete animation motions and sounds, more traffic may be wastedand a long sending response time may also be needed. Therefore, in thisembodiment, it is just required to receive the sound and then perform acorresponding match between the received sound and a local animation tolocally form a complete analog image of the animation motions plussounds. This technical logic solves the problem that a longer time andmore traffic are needed during sending of the complete animations plussounds.

S402: Synthesize the sound with a pre-stored animation, so as to formanalog image data.

There may be multiple pre-stored animations described herein, and theanimation synthesized with the sound may be randomly selected from themultiple animations, or may also be selected by the user. Thesynthesized sound and animation have consistency, so that the currentmood of the user can be vividly expressed. Therefore, a volume or toneof the sound can be used as a reference to find a suitable animation bymatching, so as to simulate a real scenario during communication asclosely as possible.

During specific implementation, a matching rule may be defined first: itis assumed that one of the animations can be divided into N motionpictures and a motion style of each motion picture is different fromothers; if a corresponding animation is selected according to thevolume, a volume upper limit Volmax can be set; in addition, volumesfrom 0 to Volmax are divided into N sections with the same number as themotion pictures and are numbered from 1 to N, and then the N motionpictures are each made to correspond to a different volume, where amotion picture with a smaller motion range corresponds to a smallervolume.

When the sound that has gone through the sound change processing ismatched with a suitable motion picture, the sound that has gone throughthe sound change processing can be divided into multiple time periods,and an average value of volumes during each time period is sampled. Eachaverage value of volumes may be equal or the closest to a volume pointin the foregoing defined rule, and in this way, a motion picturecorresponding to each average value of volumes can be found. After allmotion pictures corresponding to average values of volumes during eachtime period are found, these motion pictures can be combined into acontinuous animation, and the motion style of this animation coincideswith a volume of the sound that has gone through the sound changeprocessing. At last, the sound that has gone through the sound changeprocessing is incorporated with the animation to form the analog imagedata. In this way, animation effects and sound effects contained in theobtained analog image data can vividly and correctly reflect the stateand mood of the user.

S403: Play the analog image data.

A process of playing the analog image data is a process of playing thesound that has gone through the sound change processing and thecorresponding animation at the same time. Different from an existingcommunication manner, the present disclosure vividly presents a realstate of a user. However, a speed and tone of a sound of each person aredifferent from others. Therefore, synchronization and a match between asound and a motion need to be achieved when the analog image data isplayed. For example, a change in a mouth shape is basically synchronouswith the speaking speed of a user when the user speaks, and the speed ofa body motion also needs to be synchronous with and match the speakingspeed.

During specific implementation, when the sound that has gone through thesound change processing is played, an average value of volumes or tonesduring each time period may be sampled at the same time, and then acorresponding motion picture is synchronously displayed according to thesampled average value of volumes or tones during each time period. Inthis way, a match between the sound and the motion may be achieved.

Besides, in start and end phases of playing the analog image data, somefixed animation motions can be added. These motions are irrelevant tothe sound of the user and are just animation motions that fixedly existto make the animation interesting.

Referring to FIG. 5, FIG. 5 is a flowchart of another communicationinformation processing method according to an embodiment of the presentinvention. The method includes the following steps:

S501: Receive a sound that has gone through sound change processing froma to-be-communicated party.

S502: Receive a role image identification sent from the communicationobject.

S503: Synthesize the sound with a pre-stored animation that correspondsto the role image identification, so as to form analog image data.

S504: Play the analog image data.

Embodiment 4 differs from Embodiment 3 in that: an animation image thata user wants to show can be selected by using a role imageidentification sent from a to-be-communicated party, where each roleimage identification corresponds to one pre-stored animation image,which also makes communication content more colorful.

The present disclosure further provides an instant massaging system.Referring to FIG. 6, the system includes at least two terminals 60 thatare connected to each other via a network, where the terminals 60 cancommunicate with each other in an analog image manner which combines ananimation and a sound effect.

Referring to FIG. 7, FIG. 7 is a structural diagram of a terminalaccording to an embodiment of the present invention. The terminalincludes a storage module 61, a sound recording module 62, a soundchange module 63, a communication module 64, a synthesizing module 65,and a playing module 66, where the sound change module 63 is connectedto the sound recording module 62, the communication module 64 isseparately connected to the sound change module 63 and the synthesizingmodule 65, and the storage module 61 and the playing module 66 are bothconnected to the synthesizing module 65.

When the terminals 60 communicate with each other (description is givenherein by using an example in which communication information istransferred between two terminals 60 once), a user records a sound byusing the sound recording module 62 first, where the sound recordingmodule 62 may be a microphone. Certainly, a user may also record a soundin advance (for example, some common chatting statements) and store thesound in a local database. In this way, the recorded sound can bedirectly selected and used, thereby improving communication efficiency.

After sound recording is complete, sound change processing is performedby using the sound change module 63. An audio processing library(Soundtouch) is preferably used to perform the sound change processing.Soundtouch mainly implements three functions of varying a rate, varyinga tone, and varying a rate and tone at the same time, can perform areal-time operation on a media stream, and can also perform an operationon an audio file. Three parameters of the Soundtouch, such as, a rate,tempo, and pitch may be adjusted, thereby changing the sound of the userto simulate some special sounds. For example, if the parameter pitch isadjusted to 0.8, a low and slow sound of cattle and bears can besimulated; and if the parameter pitch is adjusted to 1.2, a loud soundof chicken can be simulated. The user can simulate a suitable soundaccording to a current mood or state, so that the sound may become morepersonalized and vivid.

After the sound change is complete, the sound can be sent to anotherterminal 60 by using the communication module 64. After receiving thesound that has gone through the sound change processing, a communicationmodule 64 of the another terminal 60 transfers the sound to thesynthesizing module 65 for processing. The synthesizing module 65synthesizes the sound that has gone through the sound change processingwith an animation stored in the storage module 61, so as to form analogimage data. The synthesized sound and animation need to be consistentwith each other, and a volume or tone of the sound can be used as areference to find a suitable animation by matching. For example, whenthe sound volume of the user is large and the tone is high, an animationwith an exaggerated motion can be selected; and when the sound volume ofthe user is small and the tone is low, an animation consistent with thisstyle can be selected, so as to simulate a real scenario duringcommunication as closely as possible.

After synthesis is complete, the analog image data may be transferred tothe playing module 66 for playing. A process of playing the analog imagedata is a process of playing the sound that has gone through the soundchange processing and the corresponding animation at the same time. Sofar, one transmission of communication data is complete. Different froman existing communication manner, the present disclosure vividlypresents a real state of a user.

It should be noted that because data volume of animations is large,network traffic needed to transmit the animations is large, and a longsending response time is also needed, in this embodiment, afterrecording a sound, one terminal transmits the sound that has gonethrough sound change processing to the other terminal, and the otherterminal completes synthesis of sounds and animations. This preferreddata transmission manner also solves the problem that a longer time andmore traffic are needed during sending of the complete animations plussounds. However, the present disclosure is not limited thereto. In acase in which network bandwidth is ample, synthesis of sounds andanimations can be completed by one terminal, and then the analog imagedata is transmitted to the other terminal directly for playing. In thissituation, because the sound recording and the synthesis of sounds plusanimations are performed by one same terminal, matching sounds withanimations may be easier, and a dynamic animation (a Gif dynamicpicture) may even be used as a reference to find a suitable sound bymatching. There are multiple synthesis forms.

Additionally, in start and end phases of playing the analog image data,some fixed animation motions can be added. These motions are irrelevantto the sound of the user and are just animation motions that fixedlyexist to make the animation interesting.

Referring to FIG. 8, FIG. 8 is a structural diagram of a more detailedterminal according to an embodiment of the present invention. Comparedwith FIG. 7, an interaction module 67 is added to a terminal 60 in thisembodiment, where the interaction module 67 is separately connected tothe communication module 64 and the storage module 61 and is configuredto receive a role image identification selected by a user, where therole image identification corresponds to at least one of the animationsstored in the storage module 61. Animation images stored in the storagemodule 61 may be diversified. Therefore, in one terminal, the user mayselect an animation image that the user wants to show to the other partyby using the role image identification input by the interaction module67, and then send the role image identification together with the soundthat has gone through the sound change processing to the other terminalby using the communication module 64; the other terminal finds acorresponding animation thereof in the storage module 61 according tothe role image identification; and the synthesizing module 65synthesizes the sound and the animation to form analog image data.

The synthesizing module 65 in this embodiment includes a selection unit651, a separation unit 652, a first sampling unit 653, a matching unit654, and a combination unit 655, where the selection unit 651 isseparately connected to the communication module 64 and the storagemodule 61, the separation unit 652 is connected to the communicationmodule 64, the first sampling unit 653 is connected to the separationunit 652, the matching unit 654 is separately connected to the firstsampling unit 653 and the selection unit 651, and the combination unit655 is separately connected to the matching unit 654 and the playingmodule 66.

The synthesizing module 65 in this embodiment uses a volume or tone of asound as a reference to find a suitable animation by matching. Toachieve consistency between the sound and the animation, a matching rulemay be defined first: it is assumed that one of the animations may bedivided into N motion pictures and a motion style of each motion pictureis different from others; if a corresponding animation is selectedaccording to the volume, a volume upper limit Volmax may be set; inaddition, volumes from 0 to Volmax are divided into N sections with thesame number as the motion pictures and are numbered from 1 to N, andthen the N motion pictures are each made to correspond to a differentvolume, where a motion picture with a smaller motion range correspondsto a smaller volume.

During actual operations, the selection unit 651 first selects,according to the role image identification received by the communicationmodule 64, a picture group (e.g., an animation composed of N picturesand corresponding to the role image identification) from the storagemodule 61. in addition, the separation unit 652 divides the sound thathas gone through the sound change processing and is received by thecommunication module 64 into multiple time periods; and the firstsampling unit 653 samples an average value of volumes during each timeperiod, where each average value of volumes may be equal or the closestto a volume point in the foregoing defined rule. The matching unit 654acquires, according to the average value of volumes or tones during eachtime period, a motion picture corresponding to each time period in thepicture group selected by the selection unit 651. After all motionpictures corresponding to average values of volumes during each timeperiod are found, the combination unit 655 combines these motionpictures into a continuous animation, where the motion style of thisanimation coincides with a volume of the sound that has gone through thesound change processing. At last, the combination unit 655 incorporatesthe sound that has gone through the sound change processing with theanimation to form the analog image data. In this way, animation effectsand sound effects contained in the obtained analog image data canvividly and correctly reflect the state and mood of the user.

A speed and tone of a sound of each person are different from others.Therefore, synchronization and a match between a sound and a motion needto be achieved when the analog image data is played. For example, achange in a mouth shape is basically synchronous with the speaking speedof a user when the user speaks, and the speed of a body motion alsoneeds to be synchronous with and match the speaking speed. Therefore,the playing module 66 in this embodiment further includes a soundplaying unit 661, a second sampling unit 662, a picture extracting unit663, and a display unit 664, where the sound playing unit 661 isconnected to the combination unit 655, the second sampling unit 662 isconnected to the sound playing unit 661, the picture extracting unit 663is separately connected to the combination unit 655 and the secondsampling unit 662, and the display unit 664 is connected to the pictureextracting unit 663.

During playing, the sound playing unit 661 plays the sound that has gonethrough the sound change processing, and at the same time, the secondsampling unit 662 samples an average value of volumes or tones duringeach time period. Then, the picture extracting unit 663 extracts acorresponding motion picture according to the average value of volumesor tones during each time period sampled by the second sampling unit662, and the display unit 664 displays the motion picture synchronously.In this way, consistency and a match between an animation motion and asound can be achieved.

The present disclosure further provides a terminal that has the samefunctions and working principles as the terminal structure described inthe embodiments shown in FIG. 7 and FIG. 8, which is not describedherein again.

The present disclosure further provides one or more storage mediacontaining a computer executable instruction, the computer executableinstruction being used for executing an instant messaging method, andthe method including the following steps: receiving, by a firstterminal, a sound recorded by a user, and performing sound changeprocessing on the sound recorded by the user; sending, by the firstterminal, the sound to a second terminal; synthesizing, by the secondterminal, the received sound with a pre-stored animation, so as to formanalog image data; and playing, by the second terminal, the analog imagedata.

The present disclosure further provides one or more storage mediacontaining a computer executable instruction, the computer executableinstruction being used for executing a communication informationprocessing method, and the method including the following steps:receiving a changed sound after sound change processing from anto-be-communicated party; synthesizing the changed sound with apre-stored animation, so as to form analog image data; and playing theanalog image data.

The present disclosure provides an entirely new communication medium,which uses an entirely new communication manner including an expressionanimation and also including a personalized sound, and is shown in theform of an analog image by intelligent processing, which can vividly andcorrectly reflect a state and mood of a user.

The present disclosure automatically matches a sound recorded by theuser with an image selected by the user to form analog image data, whichsimplifies user operations. The present disclosure can transmit a soundthat has gone through sound change processing only in a network, andthen correspondingly match a received sound with a local animation,which greatly saves traffic during network transmission, shortens asending response time, and improves communication efficiency.

The above descriptions are merely preferred embodiments of the presentinvention, and are not intended to limit the present disclosure in anyform. Although the present disclosure has been disclosed above throughthe preferred embodiments, the embodiments are not intended to limit thepresent disclosure. A person skilled in the art can make some equivalentvariations, alterations or modifications to the above-disclosedtechnical content without departing from the scope of the technicalsolutions of the present disclosure to obtain equivalent embodiments.Any simple alteration, equivalent change or modification made to theabove embodiments according to the technical essence of the presentdisclosure without departing from the content of the technical solutionsof the present disclosure shall fall within the scope of the technicalsolutions of the present disclosure.

What is claimed is:
 1. An instant messaging method, comprising:receiving, by a first terminal, a sound recorded by a user; performing,by the first terminal, sound change processing on the sound recorded bythe user to provide a changed sound; and sending, by the first terminal,the changed sound after the sound change processing to a second terminalfor the second terminal to synthesize the changed sound with apre-stored animation in the second terminal to form analog image dataand to play the analog image data.
 2. The instant messaging methodaccording to claim 1, wherein the step of receiving, by the firstterminal, the sound recorded by the user, and performing sound changeprocessing on the sound recorded by the user comprises: performing, byusing an audio processing library, the sound change processing on thesound recorded by the user.
 3. The instant messaging method according toclaim 1, wherein: before the step of sending, by the first terminal, thechanged sound to the second terminal, the method further comprises:receiving, by the first terminal, a role image identification selectedby the user; and the step of sending, by the first terminal, the changedsound to the second terminal comprises: sending, by the first terminal,the changed sound and the role image identification to the secondterminal for the second terminal to synthesize the changed sound with apre-stored animation that corresponds to the role image identification,so as to form the analog image data.
 4. The instant messaging methodaccording to claim 3, wherein the second terminal synthesizes thechanged sound with the pre-stored animation that corresponds to the roleimage identification, so as to form the analog image data by: selecting,according to the role image identification, a pre-stored picture group;dividing the changed sound into multiple time periods, and sampling anaverage value of volumes or tones during each time period; acquiring,according to the average value of volumes or tones during each timeperiod, a motion picture corresponding to each time period in thepicture group; and combining the motion pictures into a continuousanimation, and incorporating the sound with the animation, so as to formthe analog image data.
 5. The instant messaging method according toclaim 4, wherein the second terminal plays the analog image data by:playing the changed sound after the sound change processing, andsampling an average value of volumes or tones during each time period atthe same time; and synchronously displaying a corresponding motionpicture according to the sampled average value of volumes or tonesduring each time period.
 6. A communication information processingmethod, comprising: receiving a changed sound after a sound changeprocessing from a to-be-communicated party; synthesizing the changedsound with a pre-stored animation, so as to form analog image data; andplaying the analog image data.
 7. The communication informationprocessing method according to claim 6, wherein the sound changeprocessing is performed on a sound by using an audio processing libraryto provide the changed sound.
 8. The communication informationprocessing method according to claim 6, wherein before the step ofsynthesizing the changed sound with a pre-stored animation, so as toform the analog image data, the method further comprises: receiving arole image identification from the to-be-communicated party; and thestep of synthesizing the changed sound with the pre-stored animation, soas to form the analog image data comprises: synthesizing the changedsound with a pre-stored animation that corresponds to the role imageidentification, so as to form the analog image data.
 9. Thecommunication information processing method according to claim 8,wherein the step of synthesizing the changed sound with a pre-storedanimation that corresponds to the role image identification, so as toform the analog image data comprises: selecting a pre-stored picturegroup according to the role image identification; dividing the changedsound into multiple time periods, and sampling an average value ofvolumes or tones during each time period; acquiring, according to theaverage value of volumes or tones during each time period, a motionpicture corresponding to each time period in the picture group; andcombining the motion pictures into a continuous animation, andincorporating the sound with the animation, so as to form the analogimage data.
 10. The communication information processing methodaccording to claim 9, wherein the step of playing the analog image datacomprises: playing the changed sound, and sampling an average value ofvolumes or tones during each time period at the same time; andsynchronously displaying a corresponding motion picture according to thesampled average value of volumes or tones during each time period. 11.An instant messaging system, comprising: at least two terminalsconnected to each other via a network, each terminal comprising: astorage module, configured to store multiple animations; a soundrecording module, configured to receive a sound recorded by a user; asound change module, configured to perform sound change processing onthe sound recorded by the user to provide a changed sound; acommunication module, configured to send the changed sound after thesound change processing to another terminal or receive a changed soundfrom another terminal; a synthesizing module, configured to synthesizethe changed sound from the another terminal with the multiple animationsstored in the storage module, so as to form analog image data; and aplayingplay module, configured to play the analog image data.
 12. Theinstant messaging system according to claim 11, wherein the sound changemodule uses an audio processing library to perform the sound changeprocessing on the sound recorded by the user.
 13. The instant messagingsystem according to claim 11, wherein each module further comprises: aninteraction module, configured to receive a role image identificationselected by the user, the role image identification being correspondingto at least one of the multiple animations stored in the storage module,wherein: the communication module configured to send, when communicatingwith the another terminal, the role image identification together withthe changed sound to the another terminal, or receives a role imageidentification and the changed sound from the another terminal; and thesynthesizing module configured to synthesize, during forming of theanalog image data, the changed sound from the another terminal with theanimation that is stored in the storage module and corresponds to therole image identification received from the another terminal, so as toform the analog image data.
 14. The instant messaging system accordingto claim 13, wherein the synthesizing module comprises: a selectionunit, configured to select a corresponding picture group from thestorage module according to the role image identification received fromthe another terminal; a separation unit, configured to divide thechanged sound that has gone through the sound change processing and isreceived from the another terminal into multiple time periods; a firstsampling unit, configured to sample an average value of volumes or tonesduring each time period; a matching unit, configured to acquire,according to the average value of volumes or tones during each timeperiod, a motion picture corresponding to each time period in thepicture group; and a combination unit, configured to combine the motionpictures into a continuous animation and incorporate the sound with theanimation to form the analog image data.
 15. The instant messagingsystem according to claim 14, wherein the display module furthercomprises: a sound playing unit, configured to play the changed soundthat has gone through the sound change processing and is received fromthe another terminal; a second sampling unit, configured to sample anaverage value of volumes or tones during each time period at the sametime the sound playing unit plays the sound; a picture extracting unit,configured to synchronously extract, according to the sampled averagevalue of volumes or tones during each time period, a correspondingmotion picture; and a display unit, configured to synchronously displaythe motion picture extracted by the picture extracting unit.